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Abstract. Two classical stochastic processes are considered, the Ehrenfest 
process, introduced in 1907 in the kinetic theory of gases to describe the heat 
exchange between two bodies and the Engset process, one of the early (1918) 
stochastic models of communication networks. This paper investigates the 
asymptotic behavior of the distributions of hitting times of these two processes 
when the number of particles / sources goes to infinity. Results concerning the 
hitting times of boundaries in particular are obtained. The paper relies on 
martingale methods, a key ingredient is an important family of simple non- 
negative martingales, an analogue, for the Ehrenfest process, of the exponential 
martingales used in the study of random walks or of Brownian motion. 



A la memoire de Philippe Flajolet. 
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1. Introduction 

The Ehrenfest Process. In this paper one considers the following continuous 
time version of the classical Ehrenfest urn model. This process has been introduced 
to study the heat exchange between bodies. One assumes that each particle of a 
set of N particles is located in one of two boxes (bodies), and 1 say. A particle in 
box [resp. 1] goes into box 1 [resp. 0] at rate v [resp. fi\. One denotes as Ejf(t) 
the number of particles in box 1 at time t > 0. This birth and death process can 

also be represented as E N (t) = lj.(t) H h Y N (t), where (Yi(t), 1 < i < N) are N 

i.i.d. Markov jump processes with values in {0, 1}. 

Originally, the model is a discrete time process (Z/v(fc)) and each unit of time a 
particle is taken at random (i.e. equally likely) to be moved from one box to the 
other, Zjv(fc) is the number of particles in box 1 at time k. This corresponds to the 
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symmetrical case [i — v and, clearly, Exit) can be represented as Zn{Mn^{{Q, t])) 
if A/jv> is a Poisson process with rate Nfi. The process (-E/v(i)) follows the same 
path as (Zis[(k)) but on a time scale with a factor N/j,. The Markov chain (Zjv(fc)) 
is also random walk on the graph of the hypercube {0, 1}^ where edges connect 
elements that differ at only one coordinate. The equilibrium properties of (Eiy(t)) 
and (^jv(fc)) are fairly well known, in particular a quite precise estimate of the 
duration of time to reach equilibrium is available. See Diaconis et al. [3] . 

Results on transient quantities of this process, like the distribution of the first 
time when the box is empty, are more rare. There are generic results on birth 
and death process which describe some of these distributions in terms of spectral 
characteristics of the associated infinitesimal generator: the spectral measure and 
a family of orthogonal polynomials. See Karlin and McGregor pTJ [12] and Keil- 
son |14j . In practice, the corresponding orthogonal polynomials and, sometimes, 
their spectral measure do not have a simple representation, this complicates sig- 
nificantly detailed investigations of these hitting times. In the symmetrical case 
fx=v, Bingham pQ and Flajolet and Huillet [7] give a representation of their distri- 
butions. In the general case Crescenzo [2] and Flegg et al. [9] provide expressions 
of the densities. It turns out that the expressions obtained in these papers involve, 
in general, sums of combinatorial terms for which asymptotic results (when N goes 
to infinity) may be difficult to obtain. Note that this is nevertheless done in the 
symmetrical case in Flajolet and Huillet [7]. 

The Engset Process. This is one of the oldest stochastic models of communica- 
tion networks. See Engset [3]. For this model there are N sources of communication 
which are active (state 1) or inactive (state 0). An active source becomes inactive 
at rate /x. The total number of simultaneous active sources cannot exceed the quan- 
tity Cjv, the number of circuits of the network. An inactive source can therefore 
become active only if there are already strictly less Cjv active sources, in this case 
it occurs at rate v. If Xj\r(t) is the number of active communications at time t, 
when Cjv = N the process is just the Ehrcnfcst process. Otherwise (Xjv(t)) can be 
described as a reflected version of (E^(t)). In this context, an important quantity is 
the time it takes to have the full capacity of the network used, i.e. the hitting time 
of Cn by (Xjv(t)). If the equilibrium probability that Xn is CV is known, under 
the name of Engset Formula. To the best of our knowledge, results concerning this 
hitting time are quite rare, in particular for possible asymptotics when N goes to 
infinity. 

A Storage System. Another, more recent, motivation for considering (Xpf(t)) is 
the stochastic analysis of a storage system where files are duplicated on Cm servers. 
Each server breaks down independently at rate fi in which case it is repaired but 
all its files are lost. As a simplified model, Xjy(t) is defined as the number of copies 
of a specified file, if Xj^(t) = x then a copy of the file is lost if one of the servers 
breaks down, i.e. at rate x/i. If X^(t) — 0, there is no copy of the file in the system, 
it is lost so that is an absorbing point. If < x < Cm then a new copy of the file 
may be added but at rate N — x, N is the maximal capacity of duplication of the 
system. It is easily seen that as long as (-Xjv(t)) does not hit 0, pQv(i)) is precisely 
the Engset process. In this context it is of special interest to study the distribution 
of the first time when the file is lost, i.e. the hitting time of 0. See Feuillet and 
Robert [6]. 
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A Collection of Exponential Martingales. This paper relies heavily on the 
use of martingales to derive explicit, simple, expressions of the Laplace transforms 
of the hitting times of a state of the system. One obtains expressions of these 
transforms as ratios of simple integrals for which various asymptotic results, when 
N goes to infinity, can be derived quite easily with standard technical tools. In 
particular one does not need to cope with the asymptotic behavior of sums of 
combinatorial expressions. Quite surprisingly, up to now, martingales did not play 
a major role in the previous studies of the Ehrcnfcst process. One can mention 
Simatos and Tibi |20j where a martingale approach is used to estimate certain exit 
times for multi-dimensional Ehrcnfcst processes. It is one of the results of this paper 
to show that a simple and important family of martingales allows a quite detailed 
investigation of this process, and also of its variants like the Engset process. 

The key ingredient of this paper is a set of non-negative martingales which will 
be called exponential martingales. If (M(t)) is a martingale on some probability 
space, the associated exponential martingale is the solution (Z(t)) of the stochastic 
differential equation (SDE) 

dZ(t) = Z(t-)dM(t), t>0, 

where Y(t—) is the left limit of Y at t and dY(t) is the limit on the right of t of 
s i — ^ Y(s) — Y(t—). It is called the Doleans exponential of (M(t)). See Chapter IV 
of Rogers and Williams [TH] for example. Despite there is an exponential martingale 
for each martingale, a small subset of these martingales plays an important role. 
For the standard Brownian motion (B(t)) this is the martingale 

(exp (pB{t) - p 2 t/2)) , 

for a fixed (3 6 M. It is very helpful to derive the explicit expressions of Laplace 
transforms of hitting times associated to Brownian motion. See Revuz and Yor [17] . 
For jump processes, this is less clear. It does not seem that a "classification" 
of exponential martingales exists in general, even for birth and death processes. 
See Feinsilver [5] for related questions. Some examples of important processes are 
reviewed. 

For £ e R+, Af^ denotes a Poisson process with rate £ and (Af^,i) a sequence of 
i.i.d. such Poisson processes. All Poisson processes are assumed to be independent. 

Random Walks. The classical exponential martingale associated to the 
random walk (S(t)) = (Af x ([0,t}) - -V M ([0,*])) is given by, for (3 e K, 

(1) (exp [-/JS(t) - t (A (1 - e-P) + /z (l — e*))]) . 

It is the exponential martingale associated to the martingale 

(/3(S(t) - (A - . 

The corresponding reflected process is the M/M/l queue with input rate 
A and service rate /x. 
- The M/M/oo Process. This is a classical Markov process on N whose 
Q-matrix Q = (q(x, y)) is, for x £ N, q(x,x + l) = A and q N (x, x— 1) = fix. 
It can be also seen as a kind of discrete Ornstein-Uhlenbeck process, defined 
as the solution (L(t)) of the following SDE 

L(t-) 

dL(t) =Af x (dt)- J2 A^(dt). 
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The following martingale has been introduced in Fricker et al. [10], for 

/3 e E, 

(2) ((l+^^exp^e^A/M)). 

It is the exponential martingale associated to the martingale 

(l+fief) [M x (ds)-Xds] - £ J* _i_-l { , <L(s _ )} [^ i ( ds )- M d S ]^ . 

- The Ehrenfest Process. Such a process (E'jv(t)) with AT particles can be 
seen as the solution of the SDE 

N — L{t — ) £(*-) 

dL(t) = ^( dt ) - ^,<( d *)' 

i=l i=l 

as it will be seen the corresponding exponential martingale is given by, for 

PGR, 

(3) f (l - p^ +v)t j (l + pueO**) 1 . 

It is the exponential martingale associated to the martingale defined by, up 
to the multiplicative factor /3(fj,+v), 



^ +l/)s (l {yi(s _ )=1} [A^, l (d s )- M d s ]-l {yi(s _ )=0} [A^ 4 (d s )-^d s ]) 




where (Y t (t)) are such that E N (t) = Y\(t) H h Y N {t). Recall that the 

Engset process is a reflected version of this process. 

From these exponential martingales, explicit expressions of Laplace transforms of 
the distribution of hitting times associated to these processes can be derived. It 
may be not be as straightforward as in the case of Brownian motion since the space 
variable t is not separated from the space variable, but a convenient integration 
with respect to the free parameter f3 solves the problem. See Chapter 5 and 6 of 
Robert [18] for the M/M/l and M/M/oo processes and Section[3]for the Ehrenfest 
process. 

From the point of view of potential theory, these martingales are associated to 
the set of extreme harmonic functions. This statement can be made precise in terms 
of space-time Martin boundary. See Lamperti and Snell [16) and the discussion in 
Section |3J 

Organization of the paper. In Section [21 the two stochastic processes are de- 
fined precisely. In Section [3[ the exponential martingale for the Ehrenfest process 
is introduced, and, based on it, several interesting martingales for the Ehrenfest 
process and the Engset process are constructed. As a corollary, closed form expres- 
sions of the Laplace transform of the hitting time of a given state are obtained as 
the ratio of simple integrals. This holds in particular for the blocking time for the 
Engset process. The last three sections are devoted to the analysis of the asymp- 
totic behavior of the distribution of the hitting time of C/v and when A^ goes to 
infinity in such a way that Cn^^N for some < 77 < 1. Each section consider one 
of the three possible regimes: sub-critical when v < 77, the process "lives" in the 
neighborhood of vN , super- critical when the difference Cn — X^{t) converges to a 
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finite process, and, finally, critical when Cn — .Xjv(t) is of the order of yN . For each 
regime, by taking advantage of the simple expressions of the corresponding Laplace 
transforms obtained, various convergence in distribution results are derived. 

2. The Stochastic Model 

The Ehrenfest process. Let (Y(i)) be the simple Markov process on {0, 1} whose 
Q-matrix Qy is given by 

Qy=(-" " 

For N e N, if {Y^t)), 1 < i < N, are N independent copies of {Y(t}), the 
Ehrenfest process (2£jv(£)) is also a birth and death process but on the state space 
{0, 1, . . . , N}, it is defined as 

(4) E N (t)=Y 1 (t) + Y 2 (t) + --- + Y N (t). 

The Q-matrix of (-Ejv(i)) will be denoted as Qe n , for x £ {0, ... , A}, 

(5) q EN (x, x - 1 ) = fix and q EN (x, x + 1) = v(N - x). 

The Engset process. For 1 < Cn < A, the Engset process (Xjv(t)) is a birth and 
death process on {1, . . . , Cn} which can be seen as a reflected version of (Epf{t)) at 
the boundary Cn, i.e. its Q-matrix Qx N — (<Zjv(^, J/)) is given by, for < x < Cn, 

(6) Qx N (x, x — 1) = fix and qx N {x, x + I ) = z^(7V — a;) if x < Cn- 

In particular the process (X/v(i)) has the same distribution as the process (Eiy(t)) 
constrained to the state space {0, . . . , Cn}- In particular, when Cn — N, the two 
processes (Xjv(£)) and (Sjv(i)) starting from the same initial state have the same 
distribution. 

As ergodic birth and death processes, the Markov processes (Xjv(t)) and (Epf(t)) 
are reversible and their stationary distribution at x is, up to a normalization con- 
stant, given by 



x J \fl , 

if x is an element of their respective state space. 

Normalization of the time scale. By considering the time scale t — > t/(v + fi) 
in the analysis of the processes (Sjv(f)) and (X/v(i)), it can be assumed without 
any loss of generality that v + fi = 1. This will be the case in this paper. 

A limiting regime. In the following it will be assumed that the constant Cn is 
asymptotically of the order of N, i.e. that 

(7) ?7 d =" lim CV/iV, 

holds for some i] e (0, 1]. 

For t large the probability that the variable Y(t) defined above is at I is given 
by its equilibrium distribution at 1, that is v. The law of large numbers gives that 
2?jv(i) is of the order of Nv. Roughly speaking, if Nv < Cn for N large, i.e. v < rj, 
then the boundary at Cn should not play a significant role for first order quantities 
related to pQv(f)) and therefore the processes (A/v(i)) and (.Ejv(t)) should have 
the same behavior in the limit. On the contrary if v > 77, due to the reflecting 
boundary at Cn for (A0v(t)), the Ehrenfest and the Engset processes should behave 
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differently. This phenomenon will be stated more precisely in the last three sections 
of the paper. 

3. Positive Martingales 

Several families of positive martingales for the Ehrenfest and the Engset pro- 
cesses are introduced in this section. More specifically, when (Z(t)) is either (X]y(t)) 
or (E]y(t)), one identifies a set of functions / : N x R + \-t R + such that the process 
(f(Z(t),t) is a martingale, i.e. that, for t > 0, the relation 

E(f(Z(t),t) | F" t ) = f(Z(s),s), fors<i, 

holds almost surely, where .Ft) is the natural filtration associated to (Z(t)). 

If Qz = (lz(:, ■)) is the Q-matrix of (Z(t)), this probabilistic property is equiva- 
lent to the fact that the function / is space-time harmonic with respect to Qz, i-c. 
that the relation 

(8) ^- t f(x,t) + Q z (f(;t))(x)=0 

holds for ieN and t > 0, where for h : N i-> K + , 

Qz(h)(x) = ^2q(x,y)h(y). 

A space-time harmonic function of the Markov process (Z(t)) is just an harmonic 
function of the transient Markov process ((Z(t), t)). See Appendix B of Robert [18] 
for example. When (Z(t)) is (i?Ar(t)), one will prove that there is a family fp, 
j3 £ R of such functions. As it will be seen, these martingales can be interpreted as 
exponential martingales. They will give in particular an explicit expression for the 
Laplace transform of the hitting times associated to both processes (Eiy(t)) and 

(x N (t)). _ 

For a given birth and death process, there is already a complete description of all 
such positive martingales. This is the (space-time) Martin boundary of the birth 
and death process, see Lamperti and Snell [IS]. This description is, unfortunately, 
expressed in terms of the orthogonal polynomials associated to the birth and death 
process which are defined, in general, by induction, see Karlin and Mc Gregor 
As long as moments of some transient characteristics are investigated, these mar- 
tingales can be used but they are not, in general, really helpful to analyze the 
distributions of hitting times. 

This situation is quite classical, for Brownian motion for example, for which there 
is a family of martingales indexed by N £ N, if Hm is the Hermite polynomial of 
degree N, then (M N (t)) = (t N / 2 H N (B(t)/Vt)) is a martingale. Another family of 
martingales is provided by the exponential martingale (exp(j3B(t) — (3 2 t/2)) indexed 
by P £ R. This exponential martingale can be expressed as a weighted sum of the 
martingales (Mjv(£)), but to get explicit expressions of the distributions of hitting 
times, it is the really useful martingale. See Revuz and Yor [17] for example. In 
the case of birth and death processes, a general result concerning the construction 
of such exponential martingales from the martingales associated to the orthogonal 
polynomials does not seem to exist. 
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3.1. Exponential Martingales for the Ehrenfest Process. Due to the simple 
structure of the Ehrenfest process, these martingales are really elementary. Nev- 
ertheless they play a fundamental role, most of the asymptotic results obtained 
in this paper are based on these martingales. A more general version in a multi- 
dimensional context has been introduced by Simatos and Tibi |20j . 

Proposition 1 (Exponential martingales). For j3 € M., the process 

(9) (M&(t)) = ((1 - /W) £N(t) (1 + five*) N - E "®) 
is a martingale. 

Proof. Define, for N > 1, t > and < x < N, 

(10) h%(x, t) = (1 - /Wr(l + freY'*- 
Clearly, the relations 

-h^O, t)=Pu<*=-Qy(h*(; t))(0) and -h*(l, t)=-Piu*=-Q Y (h%(; t))(l), 

hold, where Qy is the Q-matrix of (Y(t)) introduced in Section [5J Consequently, 
the function hf is space-time harmonic for the matrix Q Y , equivalently (f(Y(t),t)) 
is a martingale. 

If ((Yi(t)),l < i < N) are N i.i.d. processes with the same distribution as 
(Y(t)), then, by using the independence of the processes (Yi(t)), i = 1,...,N and 
Equation (|4|, one gets that the process 

/(m*)) d = (K(t)) = (^ N (E N (t),t)) 

is also martingale with respect to the filtration (J r t )—(a<Yi(s), s < t, 1 < i < N>). 
In particular the function hP N is space-time harmonic with respect to Qe n - The 
proposition is proved. □ 

Martingales Associated to Orthogonal Polynomials. As remarked by Karlin 
and McGregor [11] [13] , the polynomials associated to the Ehrenfest process are the 
N + 1 Krawtchouk polynomials (/C„ , < n < N) defined by 



< n,x < N. 



1=0 

These polynomials are orthogonal with respect to the binomial distribution 



u G 



,i/ R u ,0 < k < N 
k ' 

The classical identity, see Karlin and McGregor [TT] for example, 

v 

( i2 ) s ' 

1=0 ^ ' 

and the above proposition give that 

' N 'N 



\n=0 
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is a martingale. As a consequence one gets that, for any < n < N, the process 
(Sjy(t)) exp(nt)) is a martingale. The martingale (M^-(i)) can thus be seen as 
an encoding of these (iV +1) martingales in the same way as the exponential mar- 
tingale of the Brownian motion with the Hermite polynomials, or the martingale ([2|) 
with the Poisson-Charlier polynomials. See Robert [18] . 

Note that the space variable E^(t) and the time variable t are not separated in 
Expression ([9]) of the exponential martingale. Provided that it can be used, Doob's 
optional stopping theorem applied to some hitting time of some specified state x 
does not give useful information on the distribution of this variable. But given that 
there is a free parameter j3 € R in Expression ((9|) and that the martingale property 
is clearly preserved by integration with respect to /3, one may try to find a measure 
on R + that will "separate" the space and time variables. The following proposition 
uses such a method. 

Proposition 2. For any a > and t > 0, if 

-1 / „ \ N-E N (t) 



(13) 



J?(t) = e-«< f\l u) N - B »M (l + ^u) EN{t) u a - 1 du, 



Tf N = inf{< > : E N {t) =x}, < x < N, 
then (I a (t A Tq N )) and (J a (t A TS N )) are martingales. 

Proof. Since (N — Ejs[(t)) is also an Ehrenfest process but with the two parameters 
fj, and v exchanged, one needs only to prove that the process (l a (t A T N )) is a 
martingale. 
Define 

/jv(*,i) d = r \ p N {x,t)p a ~ l dp, 
Jo 

where h N is defined by Equation (fT0|) . then 



^?(m)= r d -^{x,tw-up-—h% t ^{x,t). 

dt J at [i 

Note that the last term of the above expression is if x ^ 0. Consequently, for 
x ^ and t > 0, 

"7m 



-^-(X,t) + Q EN (fN)(x,t) 



dh 



N 



dt 



x,t) + Q EN (h' N ){x,t) 



p"- 1 d[3 = 0, 



because h 1 ^ is space-time harmonic with respect to Q e n as it has been seen in the 
proof of Proposition [T] In other words, the function /jv is space-time harmonic for 
the Q-matrix of the stopped process {E^{t A Tg N )), hence 

(f N (E N (t A T^) , t A T B ")) = (/^ (t A T B ")) 

is a martingale. The proposition is proved. □ 

It is now easy to get a representation of the Laplace transform of the hitting 
times for the Ehrenfest process. 



TRANSIENT BEHAVIOR OF EHRENFEST AND ENGSET PROCESSES 



9 



Proposition 3 (Laplace Transform of Hitting Times). For < x < y < N and if 

T^ N = M{t > : E N (t) = x}, the relations 



(14) E y (e-^ N ) = ^, and E. (e"^) - /} ( ( 

V I By(a) V ) D y (a) 

hold, with 



(15) 



1 / \ N-x 

B x (a)= I (l-u) x I l + -tt) u a - x du, 

f 1 



x(a) = y (1 - it)""* (l + £u) U Q - 1 I 



There is in fact only one result here since (JV — E^(t)) is, as it has already 
been remarked, an Ehrcnfcst process with the parameters fj, and v exchanged. The 
second relation of (|14[) is therefore a consequence of the first one. 



Proof. The martingale I a (tATo) is bounded and then uniformly integrable. There- 
fore, Doob's optional stopping Theorem gives the relation 

Ey(l a {0))=Ey{I a {T x )), 

the first relation of (ITU) follows. □ 



By expanding one of the terms of the integrand of Equation (|15p , ones gets 

bm) = yJ N : x ) (£) k J\i - ufu^ du 



^ \ k 

k=0 



N-x\ (v\ T(x + l)T(a + k) 



N-w 

k J Vm7 r(a + x + k + l) 

k=0 v / \r~/ \ i 

by using the relation between Beta functions and Gamma functions. See Whittaker 
and Watson J3JJ for example. Laplace transforms of hitting times can therefore also 
expressed as ratio of such sums, it is quite common for hitting times of birth and 
death processes. See Equation (4.4) of Karlin and McGregor (T2] for example. 
Flajolet and Huillet [7] uses this kind of representations in the symmetrical case. 
As it will be seen, from the compact representation (|15p with integrals, one will get 
asymptotic results for the distribution of these variables with standard techniques. 

3.2. Martingales for the Engset Process. It has been seen that the Engset 
process (Xjv(t)) is a reflected version of the process (.Ejv(i)) at the boundary Cn- 
The two families of martingales of Proposition [5] cannot be used directly if the 
sample path of (-Ejv(i)) may exceed Cn, when the hitting times of is analyzed for 
example. The idea is to construct a linear combination of the martingales (I a (t)) 
and (J a (t)) such that the space-time harmonicity of the corresponding function 
which is valid when the space variable in {1, ... , Cn — 1} holds also at the boundary 
Cn- This method has been used in Kennedy [IB] in the case of reflected random 
walks. 



Proposition 4. For a > 0, define 
(16) 



1 / \ N-Cn-1 

b N (a)=v\ (1 - u) Cn I 1 + ~u J u a du, 

! ■ ■■ - C N 



d N (a) = M / (1 - u)"- *- 1 (l + £u) 



u a du, 
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(K»(t)) = (d N (a)I»(t) + b N (a)j£(t)), 

where (1^ (t)) and (J^ (t)) are defined by Equations (|13[) with E^it) replaced by 
Xjsr(t) then, ifT Q N is the hitting time o/O by (X]y(t)), the process (K^ (tAT N )) 
is a martingale. 

Proof. Define the function g a such that, for alH > and < x < Cm, 



g a {x,t) = d N (a) e 



N-3 

1 + -it I u c 

A* 



(l-u) 

+ b N (a) e~ at 



du 



(l-u) N -*(l + ^u) 



The function g a is space-time harmonic for the matrix Qx N on {1) ■ • ■ > Cjv 
that is 

~dg a 



1}, 



dt 



+ Qx N {g a ) 



(x,t) = 0, < x < C Nl 



since the two matrices Qx N an d Qe n are identical as long as the starting point is 
in {1, . . . , C N - 1} and that [l£(t A T^ N )) and {J%{t A T^ N )) are martingales by 
Proposition [2l 

The space-time harmonicity of g a for the matrix Qe n at Cn < N gives the 
relation 



d 9a , n ( > 



For < y < 1, one has 



(CW) - _ W (JV-CW) [5a(C N + l,i)-<? a (CW)] 



(1-1/) 



Cjv+1 



l+-y 
A* 



N-Cn-X 

-(1-1/) 



-(i-y) Cw 



A 4 



N-Cs-l 



and, with a similar identity, one gets the relation 



-q£ + Qx N (g a ) 



(C N ,t) = {N Cn) (d N (a)b N (a) - b N (a)d N (a)) = 0. 
A 4 



The function g Q is space-time harmonic for the Q-matrix of the stopped process 



{X N (t A T,f ")), the process (i^(f A T Ajv )) is therefore a martingale 



□ 



Proposition 5 (Laplace Transform of Hitting Times for Engset Process). For 

< x < y < C N , ifT*« = inf{s > : X N (s) = x}, then, for a>0, 



(17) E x (e~ aT y 



Dx{a) 
D y {a) 



and E, 



dw(a)By(a) + b N (a)D y {a) 
d N (a)B x (a) + b N (a)D x (a) ' 



with the notations of Propositions \3[ and^ 



Proof. The first identity comes from the fact that the two processes (-Xjv(i)) and 
(Sjv(t)) starting from the same initial state are identical in distribution as long as 
they do not reach Cn- In particular, if Xjv(O) = £jv(0) = x, the variables T® N 
and Ty N have the same distribution. The second identity is a direct consequence 
of the martingale property of (K^ (t A T Q N )) proved in the above proposition. □ 
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4. A Fluid Picture 

This section gives a quick description of the first order properties of the Ehrenfest 
and Engset processes as N goes to infinity. Its purpose is mainly to introduce the 
three natural possible asymptotic regimes that will be investigated in detail in the 
last sections. The proofs of the asymptotic results are quite standard and therefore 
will be skipped. 

From now on, it is assumed that Relation (0 holds, that is 

lim C N /N = r)>0. 

The Engset process (Xjv(t)) can also be seen as the unique solution of the following 
stochastic differential equation 

N-X N (t-) x N (t-) 

dX N (t) = l {XN(t „ )<CN} Yl ^M d *)- E ^(d*)' 

i=l i=l 

starting from Xjv(O), where, for £ > 0, (Af^k) denotes an i.i.d. sequence of Poisson 
processes with rate £. The different Poisson processes used are independent. 
The initial state is assumed to satisfy 

lim X N (0)/N = x e [0,7?], 

N— f+oo 

then, by complementing Poisson processes in order to get martingales, the above 
equation can be rewritten as 

dX N (t) = dM N (t) + [v(N - X N (t))l { x Nit)<CN} - fiX N {t)} dt, 

where (Mjv(t)) is a martingale of the order of y~N. In the same way as for the 
Erlang process, see Chapter 6 of Robert [18] for example, one can prove the following 
convergence in distribution of processes 

lim (X N (t)/N) = (mm(r?, v + (x - v)e~ t )) . 

This first order description of the Engset process shows that there are three different 
asymptotic regimes. 

— Super- Critical Regime: v > t). 

Under this condition the renormalized process is at the boundary Cjv at 
time 

(18) r^log^-^/^-r?)). 

A more detailed picture can be obtained by looking at the process 

(Z N (t)) = (C N -X N {t/N)) 

of empty spaces with a "slow" time scale. As N goes to infinity, is is easily 
seen that the Q-matrix of this birth and death process converges to the 
Q-matrix of an ergodic M/M/l process with input rate r\ and service rate 
v. In particular, this gives the asymptotic expression of the Engset formula, 
for t G M+, 

lim F(X N (t) =C N ) = l- X]lv. 

A-S.+00 
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— Sub-Critical Regime: v < rj. 
In this case, one has in fact 

lim (X N (t)/N) = (v+ Oo - v)e~*) = lim (E N (t)/N) . 

N— >+oo N— >+oo 

As expected, the boundary at Cat does not play a role, for the first order 
the Engset process and the Ehrenfest process are identical. 

— Critical Regime: v = ?/. 

The fluid limit picture gives that the system saturates "at infinity" which 
is a too rough description of its evolution as it will be seen. 

The next sections are devoted to the asymptotic analysis of the distributions of 
hitting times. For the sake of simplicity, it is assumed that the initial state is on the 
boundary, either or Cat . Similar results could be obtained without any additional 
difficulty when the initial state is in the neighborhood of some [zN\ for < z < rj. 

5. Super-Critical Regime 

As it has been seen, under the condition v > r\ and at time t* defined by 
Equation (|18|) . the system is saturated for the fluid limit. It implies in particular 
that the hitting time of the boundary Cat, 

=mf{s>0- 1 X N (s) = C N } 

converges in distribution to t* . The following proposition gives a more precise 
asymptotic result. See Theorem 3 of Flajolet and Huillet [7] for a related result in 
the symmetrical case. 

Proposition 6. If Cn = T)N + 0(1), r) < v and Ajv(O) = 0, then the sequence of 
random variables 

(VN [T§Z-]og(y/(v-rj))]) 
converges in distribution to a centered normal random variable with variance 



y/r/(l - rfj/(v - rj) 
Proof. Proposition [3] gives the equation 



(19) E ( e - aVWT o N i = 



/o 




for a > 0. The integrand of the numerator of the right-hand side of the above 
equation can be expressed as exp(//v(u)) with 

f N (u) = N log (l - u/Vn) + (VNa - 1) logu. 

The function has a unique maximum at 

aVN - 1 1 + a 2 ( , r—\ 
UN = -7=-^ —n= = a — + o 1/VN) , 
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and 



/jv(yiv) = (alog(a) - a)ViV- — - log(a) + o(l), 

„ (1 + a 2 ) VN 

fAVN) = 2 +°(! ■ 

or a 

Laplace's method, see Flajolet and Sedgewick [8] for example, gives therefore the 
relation 



(20) 



//v 



3 fN{u} du , 



2tt 



y/-f"(VN) 
V / 2~7ra 



oSn{vn) 



^ exp f (a log(a) - a)\/iV - y - log(a)^ 



Similarly, the integrand of the denominator of the right-hand side of Equation ([19 
is exp(<7jv(w)) with 

ffJV (u) = (JV - C N ) log ^1 - + Cjv log (l + £-^=J + (VFa - l) logu. 



This concave function on the interval (0, v N) has a unique maximum located at 
z N = z - 5/VN + o{l/VN), with 

(y 2 + T) — 2r\v) + (y — r]) 2 



zq = a and 5 = v- 

v-T) {v-r))° 

and, with some calculations, one gets the following expansions 



gN(z N ) = [ -a + a log(a) + a log 



(2vr) — v 2 — r\) a 2 



- vY 



N 



- log(a) - log 



v — r\ 



and 



9n(zn) = — (I — 17 + 77 



(l-z/) 2 \ aVN-1 



o(l). 



By using again Laplace's method, this gives the relation 



(21) exp (-ay/Nlog — ^— \ 



e 9N{u) du 



V2 



exp (a log(a) - a) VN + 



[2vr\ -v 2 -r\) a 2 



o(l) 



(v-vY 



log(a) 



Equation (TTi?)) together with Relations (|2U)) and (j2"Tj) give finally 



to« (cx P (-av^V [t*» - log(i//(i/ - !?))])) 



exp 



77(1 — ?y) 
(iy-?7) 2 T 7 ' 



the proposition is proved. 



□ 
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An Informal Proof. The limit theorem obtained in Proposition [6] is a conse- 
quence of some detailed, annoying, but simple, calculations used to apply Laplace 
method. One can get quite quickly an idea of the possible limit with the help of the 
exponential martingale (M N (t)) of Proposition [T] through a non-rigorous deriva- 
tion. As it will be seen, it gives the correct answer but its justification seems to be 
difficult. The main problem comes from the fact that, in this martingale, the term 
e* stopped at some random time may not be integrable at all. For example, it is 
easily seen that the first jump of the martingale (M-f (t)) is not a regular stopping 
time for this martingale, i.e. the optional stopping stopping theorem is not valid 
for this stopping time. 

Denote Z N = \/N (exp(T*£) — exp(i*)^ , where f* is, as before, \og(v/(v — rj)). 
By using the martingale © of Proposition Q] by assuming that the stopping time 
is regular for it, one gets 

Cn / v \ N—Cn 



E 



1 - \ie c N /3/VAT ) [l + ve c « 0/VN\ = {1 + v0/Vn\ 



this equation can be written as E (exp(£//v)) = 1, with 
U N = C N log ( 1 - ne T *N 0/VN 



+ (N- C N ) log ^1 + ve T *N — N log (l + v0/Vn} 



hence. 



U N = -P{n - v)Zn - ((?/M 2 + (1 - vV) e 2 ** - /3 2 /2 + o (l/N) 

= - v)Z N - U -^—^- 2 /2 + o (l/N) , 



provided that the limit can be taken under the integral, one gets finally 

' f3 2 v 2 r\(\ - r,)" 



lim E (e~' 3Zjv ) = exp . 
Ar->+oo v ; \2 (v-r\) 



Expressed as a limit theorem for , this is precisely the above proposition. 

6. Sub-Critical Regime 

It is assumed in this section that v < r, so that the Ehrenfest process "lives" 
in the interior of the state space, the hitting time of the boundary Cn should be 
therefore quite large. The following propositions give asymptotic results concerning 
this phenomenon. 

The first result concerns the time it takes to the Ehrenfest process to have all 
particles in one box when, initially, they are all in the other box. This is of course 
a very natural quantity for this process. In the discrete time case, representations 
of the average of this quantity have been obtained in a symmetrical setting. See 
Bingham 1 and references therein. 

Proposition 7. If v < 1, Xn(0) = and Cn = N, then the sequence of ran- 
dom variables (^Niy N T^^j converges in distribution to an exponentially distributed 
random variable with parameter 1 — v . 
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Proof. One uses Equation (fT4| of Proposition [3] to get that, for N > 1, 

(22) E( e - a « T « N ) = ^(l-ufu^- 1 du I j' (l + ^u) N u^- 1 du, 

with a 7v = aNu N , for some a > 0. 

The numerator of this expression can be written as, after an integration by parts, 



l r i 

\N..a N -l i . _ / at/-, ..\N-1 



a ,v 



(l-u)"^"- 1 d M = / ^(l-u)"- 1 du 

o Jo a N 

f N / 71, \ JV-1 ,.«JV 1 

= N- a » / ?^du~— , 

Jo V ^ y "JV "TV 

by Lebesgue's Theorem. 

By subtracting 1/ckjv from the denominator of the right-hand side of Equa- 
tion (|2"2"|) . one gets 

JoVVi^/ J v J \ v J a N 

1 f N / u\ N ^ l-(l-u/N) a * 



/ u yv-j. i - ( i - u/iV r jv , 

1 - du, 

V N J aN 



hence, 

a N f N ( u 1 - (1 -u/N) at 



a N /N 



du 



o 

f +ao _ a 
~ a ue ^ u u du = — . 

Jo M 

These two asymptotic results plugged into Equation (|22|) give the desired conver- 
gence in distribution. 

□ 

Theorem 2 of Bingham [X, provides a similar result in the symmetrical case /i = v 
and in discrete time. In the present case, there is an additional factor N in the 
scaling of T c N which is due to the fact that the continuous time dynamics are TV 
times faster than the discrete time case. 

Proposition 8. If Cn = r]N + 0(1), v < 77 < 1 and Xn{0) = 0, then if 

1 — 77 s 



H=(l-r,)log(—^j+ V log(^), 
the sequence of random variables 



(23) I , v " - ^- VNe-»» T Cn 



(rj — v)^/2tt 

converges in distribution to an exponentially distributed random variable with pa- 
rameter 1. 

One remarks that the exponential decay factor H of the above proposition is in 
fact a relative entropy of Bernoulli random variables with respective parameters rj 
and v. Despite similar "entropy" expressions appeared at several occasions in the 
study of these processes, we have not been able to find a simple explanation for the 
occurrences of these constants. 
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Proof. For a > 0, denote by oat the product of a and the coefficient of T c ™ in 
Expression (|23p . Equation (|14l) of Proposition [3] is again used 
(24) 



E 



ajv — 1 



du 



The asymptotic behavior of the numerator of this Laplace transform has already 
been obtained in the proof of the above proposition. 

To study the denominator, one will proceed as before. For u £ [0, 1], denote 

f N (u) = (N- C N ) log(l - u) + C N log (1 + fiu/v) , 

this function has a unique maximum at 



y d ^ Cn/N - V = ^-4 + Od/N) 



1 - v 



(1-u) 



which is given by 
and 



(1 - rj) log 



1-u 



?ylog (- 



AT + (1), 



T)(l - Tj) 

The denominator of Laplace Transform (j2"4")l is 



^iv-l du = 









e /jv(«) _ ^ 


/o 





,ajv — 1 



1 v lni aN — Ti a N\ 1 



OiM OiN 

by integration by parts. The integral In of the right-hand side of this relation can 
be written as 



In = 
hence, 



(1-2/0 )vW 



1 



-yoVN 



In = ?y " 



f' N [vo + 



z M Vn+ u/VN) (vo N - (w> + uHny») 

OIN 



du, 



,-1 -/^fao) f N (y ) 
jV 3 / 2 



+00 



2 

u exp 



f_M u 2 

N 2 



du + o(l/N) 



2tt 



„ w f -77 M ^^)+ (l/iV). 

By gathering these asymptotic results one gets the relation 

lim E fexp (-a N T^ N )) = 1/(1 + a), 

N— »+oo \ \ / / 

the proposition is proved. 



□ 



Proposition 9 (Hitting time of the empty state). Under the condition v < n and 
if Cn = T]N + 0(1) for rj > and Xn(0) — Cm , then the sequence of variables 



TRANSIENT BEHAVIOR OF EHRENFEST AND ENGSET PROCESSES 



17 



converges in distribution to an exponential random variable with parameter v. 

Not that this result can be, informally, justified with the result of Proposition [7J 
Without the boundary Cat, one could obtain the result by exchanging \i and v and 
by using Proposition [71 This result shows in particular that the boundary does not 
change the limiting behavior of T Xn in the sub-critical regime. 

Proof. Denote chn = N(l — v) . Proposition [5] gives the relation 

d N {a N )B CN {a N ) + b N (a N )D CN (a N ) 



(25) E Cn (e-^ T o XN ) 



d N (a)B (a N ) + b N (a N )D (a N ) 
One starts with the asymptotic behavior of (djv(ajv)), 

Cjv 



. VW / \ N-C N -l / x Cjv 



NJo V VNJ V v^/N 

+ 00 



du + o{l/VN\ 



The other coefficient 6/v(c*a0 is such that 

b N {a N ) = vJ (1 - u) Cn (l + -u\ u aN du 

cN , . n „, / .. \ N-O n -1 



o(l/JV). 



V 

N 

v(l - v) 1 



v — T] N 

The proof of Proposition [7J provides the following relations 



D (a N ) 

and 



/ {l-u) N u aif - 1 du - — , 

JO a N 



1 1 



B {a N )= l + u"^ 1 d« 



/x J ~ a N ' vN{l-v) N ' 

The two remaining terms to estimate are 

r 1 ( v \ n ~ Cn 

B CN (a N ) = J (l-uf N (! + -«) u""- 1 dw, 

£>c„ M = f\l - u) n ' Cn (l + ^u) CN d 



u. 



With the same method as in the proof of Proposition [H one can show that the 
quantities Bc N (ajv) and Dc N can be written as 1/o>n + o (-Bo(ow) — 1/ajv))- 
More informally, the term (1 — u) under the integral for these two expressions 
reduces by an exponential factor their asymptotic behavior. 
These various estimations give finally that 

lim E(e— o-)= hm B C,M _ 1 



N^+oo \ / W-y+oo Bo (a at) 1 + Ct/v 

The proposition is proved. □ 
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7. Critical Regime 

In this section, it is assumed that Cn ~ vN, if Xn(0) — 0, the fluid limit 
of the process is given by v(l — exp(— t)), the fluid boundary v is reach at time 
t = +oo. In fact, with a second order description, the process Xjsr(t) can be 
written as Xjv(i) ~ v{\ — exp(—t))N + Y(t)V~N for some ergodic diffusion process 
(Y(t)), so that the hitting time T^ N of the boundary is such that 



nX 

/ _ v , \ ! y 1 

exp 



{- T c N N ) 



Cn 



which gives a rough estimation T C N ~ log(vTV). The following proposition shows 
that this approximation is fact quite precise. See also Theorem 4 of Flajolet and 
Huillet ffl. 



Proposition 10. IfC N = vN + 5VN + o{VN) with u<l,SeR, and X N (0) = 0, 
then the sequence of random variables 



(t*£ - log(AT)/2 



converges in distribution to a random variable Z on R whose Laplace transform at 
a > is given by, 

(26) E (e-*) = r(a) / jf + °° exp - y 

If S = 0, then the variable Z — log {v/{l — v)) /2 has the following density on R, 

a; h-> \/2/7rcxp (-x - e _2x /2) . 
As usual r refers to the Gamma function, 



r(a) = / e""^- 1 du, a > 0, 
Jo 

see Whittaker and Watson for example. Note that the Laplace transform of 
the limit in distribution is the ratio of the Mellin transforms of the functions 

( 8 u 2 {l- v ) 
u i— > exp(— it) and u i— > exp — u 



v 2 

See Flajolet and Sedgewick [8] on Mellin transforms. 
Proof. Proposition Ogives the equation 

Eo (V<v™ \ = jf 1 (l - u f u «-i d^ I' (1 - (l + ) n"" 1 du, 

for a > 0. The asymptotic behavior of the numerator is easy since 

-1 1 rN , „ jv 



V ' N a J \ NJ 



7V Q 



The denominator can be expressed as 

1 
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f N (u) = (N-C N )log[ 1--^=) +CWlog(l + ^^L 



with 



(1 - v) u 2 5 ,„ , /— , 

= -— + -u + o(l/VN). 

v 2 v 

By using Lebesgue's Theorem, one gets therefore that, for a > 0, 

.. m / -a [T^ -log AT/21 \ „, , / / 1+0 ° / (1-^)U 2 , <5 , . 

hm E e L c n 1 J = r(a) / / exp V -u ) u 1 du 

w^+oo y j I J \ v 2 v ' 

and hence the first part of the proposition. 
Assume that 8 — 0, a change of variable gives 

+°° / -a (l- v )\ „ , , 1 / 2*/ ^ Q/2 



The Laplace transform of Z can therefore be expressed as 

by using Legendre's duplication Formula for Gamma functions. See Whittaker and 
Watson [21] page 240. Since 

— =r —I— \= exp (a log(2«) /2 - log(w) /2 - u) du 



'7T 



/" + e~ a " exp (-u - e- 2u /2) du, 

V J — oo 



with a change of variables, one gets the desired result on the distribution of Z. The 
proposition is proved. □ 

One concludes with the hitting time of empty state, one can remark that, at the 
correct time scale, the time is half of the corresponding variable in the sub-critical 
case. See Proposition |H1 A simple, naive, explanation is as follows. For sub-critical 
regime the process lives in a region centered at vN and whose width is of the order 
of y/~N and therefore makes many excursions in this region before reaching Cn ■ In 
the critical case the process lives near, but only on one side of vN . In particular it 
cannot go above Cn and, consequently, does not waste time on such excursions. 



Proposition 11 (Hitting Time of Empty State). IfC N = vN + o{\/N) with v > 
and Xn{0) — Cn , then the sequence of random variables 

(N(l - u) N TfA 

converges in distribution to an exponential random variable with parameter 2u. 
Proof. Denote = N(l — v) N . Recall that 

„- aN T?"\ _ d N {a)B CN (a) + b N (a)D CN (a) 



e 



d N (a)B Q (a) + b N (a)D (a) 
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One starts with the asymptotic behavior of (cIn(oin)), by definition 

u 

v 



d N (a N )=pt j (1 - u) n - Cn - 1 (l + u aN du 



/N Jo \ VNJ \ "VN. 

Since r\ = v and [i = 1 — v, the relation v{l — rj) — [ir\ holds and therefore 



d 



JV 



Note that, up to a term —1 in an exponent which does not play a role in the 
limiting behavior, the quantity 6j\r(a) is almost djv(a) with v replaced by (1 — v). 
Consequently &Ar(a) has the same asymptotic expansion as d/v(a). 

The asymptotic behaviors of the quantities Bo(q:n)i Do(a]\[), Bc n {un) and 
Dc N (otN) are the same as the ones obtained in the proof of Proposition [5] By 
gathering these various estimations one gets that 



lim E(exp(-a N T* N )) =2/{2 + a/v), 

J— f+oo \ \ / / 



N-S.+CO 

the proposition is proved. □ 
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